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We propose a method of detecting a phase transition in a generalized Polya urn in an informa¬ 
tion cascade experiment. The method is based on the asymptotic behavior of the correlation 
C(t ) between the first subject’s choice and the t + 1-th subject’s choice, the limit value of 
which, c = lim f _,oo C(t), is the order parameter of the phase transition. To verify the method, 
we perform a voting experiment using two-choice questions. An um X is chosen at random 
from two ums A and B, which contain red and blue balls in different configurations. Subjects 
sequentially guess whether X is A or B using information about the prior subjects’ choices 
and the color of a ball randomly drawn from X. The color tells the subject which is X with 
probability q. We set q e {5/9,6/9,7/9, 8/9} by controlling the configurations of red and blue 
balls in A and B. The (average) lengths of the sequence of the subjects are 63, 63, 54.0, and 
60.5 for q e {5/9,6/9,7/9, 8/9}, respectively. We describe the sequential voting process by a 
nonlinear Polya um model. The model suggests the possibility of a phase transition when q 
changes. We show that c > 0 (= 0)forg = 5/9,6/9 (7/9, 8/9) and detect the phase transition 
using the proposed method. 
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1. Introduction 

The social contagion process has long been extensively studied. 1 3 Because of progress 
in information communication technology, we often rely on social information for decision 
making. 4-6 The Polya urn is a simple stochastic process in which contagion is taken into 
account by a reinforcement mechanism. 7 There are initially R 0 red balls and B (] blue balls in 
an um. At each step, one draws a ball randomly from the um and duplicates it. Then, one 
returns the balls, and the probability of selecting a ball of the same color is strengthened. As 
the process is repeated infinitely, the ratio of red balls in the urn z becomes random and obeys 
the beta distribution (3(R 0 , B 0 ). In the process, information on the first draw propagates and 
affects infinitely later draws. The correlation between the color of the first ball and that of a 
ball chosen later is 1 /{Rq + B 0 + l). 8 

As the Polya urn process is very simple, and there are many reinforcement phenomena 
in nature and the social environment, many variants of the process have been proposed un¬ 
der the name of generalized Polya um. 9 One example is the lock-in phenomenon proposed 
by Arthur as a mechanism by which a technology, product, or service dominates others and 
occupies a large market share. 10 The dominant one is not necessarily superior to the others 
in some respect. The necessary condition for lock-in is externality, in which wider adop¬ 
tion induces posterior superiority. Arthur used a generalized Polya um to explain the lock-in 
phenomenon. In the process, the choice of the ball (technology, product, or service) is de¬ 
scribed by a nonlinear function /(z) of the ratio of red balls z. In contrast to the original 
Polya urn, where /(z) = z, the ratio of red balls converges to a stable fixed point z* = /(z*) 
in the nonlinear model. 11 Mathematically, the fixed points z* are categorized as upcrossings 
and downcrossings, at which the graph y = /(z) crosses the graph y = z going upward and 
downward, respectively. The downcrossing (upcrossing) fixed point is stable (unstable), as 
the probability that z converges to it is positive (zero). Arthur adopted an S-shaped /(z) with 
two stable fixed points and noted that random selection among the fixed points also occurs in 
the adoption process. 

If the number of stable fixed points changes as one changes the parameters of the func¬ 
tion /(z), the generalized Polya um shows a transition. 12 - 13 The order parameter is the limit 
value of the correlation between the first drawn ball and later drawn balls. 14,15 If /(z) is 
Z 2 -symmetric and satisfies /(z) = 1 - /(I - z), the transition becomes continuous, and the 
order parameter satisfies a scaling relation in the nonequilibrium phase transition. One good 
candidate for experimental realization of the phase transition is the information cascade ex- 
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periment. 16 There, participants answer two-choice questions sequentially. In the canonical 
setting of the experiment, two ums, A and B, with different configurations of red and blue 
balls are prepared. 17-19 One of the two ums is chosen at random to be urn X, and the question 
is whether urn X is A or B. The participants can draw a ball from urn X and see which type 
of ball it is. This knowledge, which is called the private signal, provides some information 
about X. However, the private signal does not indicate the true situation unequivocally, and 
participants have to decide under uncertainty. Participants are also provided with social infor¬ 
mation regarding how many prior participants have chosen each um. The social information 
introduces an externality to the decision making: as more participants choose urn A (B), later 
participants are more likely to identify urn X as urn A (B). The social interaction in which 
a participant tends to choose the majority choice even if it contradicts the private signal is 
called an information cascade or rational herding. 16 In a simple model of information cas¬ 
cade, if the difference in the numbers of subjects who have chosen each um exceeds two, 
the social information overwhelms subjects’ private signals. In the limit of many previous 
subjects, the decision is described by a threshold rule stating that a subject chooses an option 
if its ratio exceeds 1/2, f(z) = 0(z - 1/2). The function f(z) that describes decisions under 
social information is called a response function. 20 

To detect the phase transition caused by the change in f(z), we have proposed another in¬ 
formation cascade experiment in which subjects answer two-choice general knowledge ques¬ 
tions. 21,22 If almost all of the subjects know the answer to a question, the probability of the 
correct choice is high, and f{z) does not depend greatly on the social information. In this case, 
f{z) has only one stable fixed point. However, when almost all the subjects do not know the 
answer, they show a strong tendency to choose the majority answer. Then f{z) becomes S- 
shaped, and it could have multiple stable fixed points. We have shown that when the difficulty 
of the questions is changed, the number of stable fixed points of the experimentally derived 
f(z) changes. 21 If the questions are easy, there is only one stable fixed point, z+, and the ratio 
of the correct choice z converges to that value. If the questions are difficult, two stable fixed 
points, z+ and z~, appear. The stable fixed point to which z converges becomes random. To 
detect the randomness using experimental data, we study how the variance of z changes as 
more subjects answer questions of fixed difficulty. We showed that the variance converges to 
zero in the limit of many subjects for easy questions. For difficult questions, it converges to a 
finite and positive value, which suggests the existence of multiple stable states in the system. 

In this paper, we propose a new method of detecting the phase transition of a nonlinear 
Polya urn in an information cascade experiment. It is based on the asymptotic behavior of the 
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correlation function and the estimation of its limit value. We perform an information cascade 
experiment to verify our method. We adopt the canonical setting for an information cascade 
experiment, in which subjects guess whether um X is um A or urn B. In the proceedings of 
ECCS’14, we reported some results from the present experiment. 23 Here, we provide com¬ 
plete information about the proposed method and the results of analysis of the experimental 
data. 

The paper is organized as follows. Section 2 considers a simple model of information 
cascade. We estimate the correlation function and the order parameter. In Sect. 3, we explain 
the experimental procedure. Section 4 presents the analysis of the experimental data. We 
propose a nonlinear Polya um model based on the empirically estimated response function in 
Sect. 5. We estimate the order parameter by extrapolating the experimental results to a larger 
system. We show the possibility of the phase transition in the thermodynamic limit. Section 
6 presents a summary and future problems. Appendices provide additional information about 
the experiments. 

2. Simple Model of Information Cascade 

We study a simple model of information cascade, which is a modification of the ’’Basic 
model” in. 16 Assume that there are two options, A and B, one of which is chosen to be correct 
with equal probability. Each individual privately observes a conditionally independent signal 
about the true option. Individual V s signal, S t , is A or B, and A is observed with probability 
q if the true option is A and with probability 1 - q if the true option is B. Each individual 
also observes the decisions of all those ahead of him. Without loss of generality, we label the 
correct (incorrect) option as 1 (0), and Sj e {0,1}. The probability that S , = 1 is q. 

We assume that the first individual chooses 1 (0) if his private signal is 1 (0). The second 
individual can infer the first individual’s signal from his decision. If the first individual chose 
1 (0), the second individual chooses 1 (0) if his signal is 1 (0). If his signal contradicts the first 
individual’s choice, we assume he chooses the same option as his signal, which is different 
from the tie-breaking convention in the ’’Basic model”, 16 where the individual chooses 1 or 0 
with equal probability. There are three situations for the third individual: (1) Both predeces¬ 
sors have chosen 1. Then, irrespective of his signal, he chooses 1. The following individuals 
also choose 1 and a correct cascade, which is called an up cascade in, 16 starts. (2) Both have 
chosen 0, and an incorrect cascade, or down cascade, starts. (3) One has chosen 1, and the 
other has chosen 0. The third individual is in the same situation as the first individual, and 
he choose the option matching his signal. The probability that both of the first two individ- 
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uals receive correct (incorrect) signals is <r((l - q) 2 ), so an up (down) cascade starts with 
probability q 2 {{ 1 - q) 2 ). 
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Fig. 1. Simple model of information cascade. States N(t) 6 {2,1,0, — 1, -2} and probabilities for X(t) 6 {0,1). 


We denote the difference in the number of correct and incorrect choices up to the f-th 
individual as N(t). From the above discussion, if N{t) > 2(< -2), an up (down) cascade 
starts. There are essentially five states, N(t) e {-2, -1,0,1,2}, if we identify all states with 
N(t) > 2(< -2) as N(t) = 2(-2). If t is even, there are three states, Nil) e {-2,0,2}, and there 
are four states, N{t) e {-2, -1,1,2}, if t is odd. Figure 1 illustrates the model. In the figure, 
we also show the probabilistic rule for the transition between states. At t = 0, N{ 0) = 0, 
and it jumps to N(t) = 1(—1) with probability q (1 - q). From t - 1 to t = 2, the same rule 
applies, and N{t) increases (decreases) by 1 with probability q (1 - q). If Nil) = 2(—2) at 
t - 2, an up (down) cascade starts. Later individuals choose 1 (0) for t > 3, and N(t) remains 
2(—2). If Nil) = 0 at t = 2, the third individual chooses 1 with probability q. In general, 
if \N(t)\ < 1, N(t) increases (decreases) by 1 with probability q (1 - q). The problem is a 
random walk model with absorbing walls at N{t) = ±2. As t increases, the probability that 
the random walk is absorbed in the walls increases. In the limit t —> oo, all random walks are 
absorbed in the walls. The state N(t) = 0 for even t is absorbed into the state Nil + 2) = 2 with 
probability q 2 /{q 2 + (1 - q) 2 ) and is absorbed into the state Nil + 2) = -2 with probability 
(1 -q) 2 /(q 2 + (1 -q) 2 )- The probability for an up cascade in the limit t —» oo, which we denote 
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by P 2 ( 00 ), is then given as 

P 2 (oo) ee Pr(/V(oo) = 2) = -r—(1) 

q l + (1 - q- 

In the up (down) cascade, individuals always choose 1 (0), and Pii 00 ) is the limit value for 
the probability of the correct choice. It is greater than q for q > 1/2, and the deviation shows 
an increase in the accuracy from that of the signal. P 2 (°°) — q is a measure of the collective 
intelligence. 

We denote the Mh individual’s choice as X(t) e {0,1}. We are interested in the estimation 
of the correlation function C(t), which is defined as the covariance of X{\) and X(t + 1) 
divided by the variance of X{\ ). C(t ) can also be defined as the difference in the conditional 
probabilities: 

C(t) = Pr (X(t + 1) = 1|X(1) = 1) - Pr (X(t + 1) = 1|X(1) = 0). 


C(t) is then estimated as 

C(2n) = c(q) + 

C(2n + 1) = Ciln), 

c(q) = lim C(t) 


(1 - 2 q ) 2 


2(< ? 2 + (l-< ? ) 2 ) 

q{ i - q) 


(V(2/(i _ q))) 2 '\ 


( 2 ) 


q 2 + {l-q) r 

The derivation of C(t) is given in appendix A. The limit value c(q) = lim^c C(l) is the order 
parameter of the phase transition in a nonlinear Polya urn. The order parameter c(q) changes 
continuously with q , and it takes zero at q = 0,1. The simple model does not show a phase 
transition, and C(t ) decays exponentially with t. 


3. Experimental Setup 

The experiments reported here were conducted at Kitasato University. We performed two 
experiments, EXP-I and EXP-II. In EXP-I (II), we recruited \ID\ = 307 (33) students, mainly 
from the School of Science. In EXP-I (II), we prepared I = 200(33) questions for q 6 Q = 
{5/9,6/9,7/9}(8/15,5/9,6/9) and I = 400 questions for q = 8/9. EXP-I was performed 
during three periods, q 6 {5/9,6/9} in 2013, q = 7/9 in 2014, and q = 8/9 in 2015. EXP-II 
was performed in 2011. We label the questions as i = 1,2, ■ • ■ , I. Subjects answered 7/2 (7) 
questions for some (all) values of q in Q in EXP-I (II). We obtained 7 sequences of answers of 
length T = 63 (33) for q = 5/9,6/9(8/15,5/9,6/9) in EXP-I (II). In EXP-I for q = 7/9 and 
q = 8/9, some subjects could not answer 7/2 questions within the allotted time. The length 
T of the sequence depends on i, and the average (minimum) length T avg (T min ) is 54.0 (49) for 
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q = 7/9 and 60.5 (58) for q = 8/9. 

\ID\ subjects sequentially answered a two-choice question and received returns for each 
correct choice. We prepared 7 questions for each q e Q by randomly choosing an um from 
two different urns, urn A and um B, which contain ball A (red) and ball B (blue) in different 
proportions. We denote the answer to question q e Q,i e {1, • • • ,7} as U{q,i ) e {A,7?}. For 
q = n/m > 1 /2, urn A (B) contains n A (B) balls and m - n B (A) balls. Um A (B) contains 
more A (B) balls than B (A) balls. The subjects obtain information about um X by knowing 
the color of a ball randomly drawn from it. The color of the ball is the private signal, as it is 
not shared with other subjects. If the ball is ball A (B), X is more likely to be A (B). Further, 
q is the posterior probability that the randomly chosen ball suggests the correct urn and the 
private signal is correct. We prepared the private signal S (q, i, t ) e {A, B } for T subjects and 
7 questions in advance. In EXP-I, we controlled the ratio of the correct signal so that it was 
precisely q. Among T subjects, exactly q ■ T subjects received the correct signal. In EXP-II, 
we did not control the private signal. Among 33 subjects, q ■ 33 subjects received the correct 
signal on average. Table I summarizes the design. 


Table I. Experimental design. \ID\, number of subjects; T, length of private signal; T avg , average length of 
subject sequence; T m j n , minimum length of subject sequence; {<:/}, precision of private signal; I, number of 
questions. 


Experiment 

m 

T 

Tavg 

Tmin 

{q\ 

I 

I (2013.9~ 2013.10) 

126 

63 

63 

63 

15/9,6/9} 

200 

1(2014.12) 

109 

63 

54.0 

49 

7/9 

200 

1(2015.9) 

121 

63 

60.5 

58 

8/9 

400 

II (2011.1) 

33 

33 

33 

33 

{8/15,5/9,6/9} 

33 


Subjects answered the questions individually using their respective private signals and 
information about the previous subjects’ choices. This information, called social information, 
was given as the summary statistics of the previous subjects. If the subject answers question 
q, i after t - 1 subjects, the subject receives a private signal S (q, i, t ) and social information 
{C A (q, i, t - 1), C B (q, i, t - 1 )} from the previous 7-1 subjects. Let X(q, i, s ) e {A, 7?} be the 
5-th subject’s choice; the social information C x (q, i, t - 1), x e {A, B) is written as 

(-i 

Cx(.q> i>t 1) — ^ , 5x(q,i,s),xi 
s =1 

where C A (q, i, t - 1) + C B (q, i, t - 1 ) = t - 1 holds. 
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Cascade Experiment 


You have answered 5 questions up to now. 

| Your ball color is RED. 

| Which type,A or B ? 


/ " 1 

\/ / 


•••J 


| Please answer your confidence about your choice. 





Fig. 2. Snapshot of the screen for q - 6/9 = 2/3 in EXP-I. The private signal is shown on the second line. 
The summary statistics {C A (t), Cu(t)\ appear in the second row in the box. 


Figure 2 illustrates the experience of subjects in EXP-I more concretely. The second line 
shows the subject’s private signal. The figure below the question shows the type of question, 
q. Before the experiment, the experimenter described the ball configuration in urns A and B 
and explained how the signal is related to the likelihood for each um. The subjects can recall 
the question by looking at the figure. In the second row of the box, the social information 
is provided. In the screenshot shown in the figure, four subjects have already answered the 
question. Three of them have chosen um A, and one has chosen urn B. The subject chooses 
um A or urn B using the radio buttons in the last row of the box. They were asked to choose by 
stating how confident they are about their answer, that is, to choose 100% if they were certain 
about their choice and to choose 50% if they were not at all confident about their choice. 
The reward for the correct choice does not depend on the confidence level. Irrespective of the 
degree of confidence, subjects receive a positive return for the correct choice. After they chose 
an option and put answer button, we let them know the correct choice in the next screen. In 
EXP-II, the subjects were asked to choose um A or um B, and they were not asked to state 
their degree of confidence. In addition, we did not let them know the correct option. We only 
told them their total reward. For more details about the experimental procedure, please refer 
to the appendices. 

Hereafter, instead of A and B, we use 1 and 0 to describe the correct and incorrect 
choices and private signal as in the previous section. We use the same notation for them, 
as follows: S(q,i,t ) 6 {0,1} and X(q,i,t ) 6 {0,1}. For the social information, we de¬ 
fine {Ci(q,i,t),Co(q,i,t)} as Ci(q,i,t ) = C UiqJ) (q,i,t ) and C 0 (q,i,t ) = t - Ci(q,i,t). Fur- 
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ther, C\(q,i,t) shows the number of correct choices up to the /-th subject for question 
q 6 Q, i 6 {1,- ••,/}. In EXP-I, the length of {X(q,i,t)} and {S(q,i,t)} depends on i e I 
for <7 = 7/9 and 8/9, and one should write its dependence on i explicitly as T(q, i). For sim¬ 
plicity, we use T whenever it will not cause confusion. For example, we denote the percentage 
up to the t-th subject for question q, i as Z(q, i, t): 

Z(q,i,t ) = ~Y^X(q,i,s). 

S= 1 

We write the final value Z(q, i, T(q, i )) as Z{q, i, T ). 

4. Data Analysis 

In this section, we show the results of the analysis of the experimental data. We describe 
how the social information and private signal affect subjects’ decisions. 


4.1 Distribution of Z(q,i,T) 

We study the relationship between the precision of the signal q and Z(q, i, T min ). As we are 
interested in the dependence on the initial value of X(q, i, 1), we divide the samples according 
to the value of X(q, i, 1) = x. We denote the sample number and the average value of Z(q, i, t) 

for each case X(q, i, 1) = x as I(q\x) and Z avg (q, t\x), respectively. 

/ 

I(q\x) = ^ 8x(q,i,l), x , 

i= 1 


7 , , , 2 ( =1 Z(q, l, t)6 X (q,i, 1),; 

Z avg {q, t\x) = - —- - 

i(q\x) 

The unconditional average value of Z avg (q, t\x) is then given as 


( 3 ) 


Zavgiqj) — q ' Z avg (q 11) + (1 q) • Z avg (^q\\\ 


Z avg (q, t) corresponds to P 2 (t) in the simple model, and the deviation of Z avg (q, t) from q is a 
measure of the collective intelligence. 

Figure 3 shows boxplots of Z(q,i,T min ) for the samples with X(q, i, 1) = x 6 {0,1}. 
From left to right, q increases. When q is small, Z avg (q,T min \x) is small. The distribution 
of Z(q,i,T min ) also depends on the initial value X(q, i, 1) = x. For q = 8/9 in EXP-I, all 
Z(q, i, T min ) are larger than one-half for x = 1. This suggests that Z(q, i, t) converges to al¬ 
most 1 as t increases. On the other hand, if x = 0 for q = 8/9, there are some samples with 
Z(q, i, T min ) < l/I. We cannot judge whether all Z(q, i, t) converge to almost 1 in the limit 
t —> oo. If x = 0 with q 6 {5/9,6/9}, the distribution of Z(q, i, T min ) is wide, suggesting the 
existence of multiple fixed points where Z(q, i, t) converges. 
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EXP-I 


EXP-II 




Fig. 3. Boxplot of Z(q, i, T mm ) in EXP-I (left) and EXP-II (right). 



Fig. 4. Plot of Z avg (q, T min ) and P 2 (oo) vs. q. P 2 (oo) is given by Eq. (1). 


Figure 4 plots Z avg (q, T min ) and P 2 (°°) in Eq. (1) as a function of q. One can clearly see the 
collective intelligence effect, as Z avg (q, T min ) - q is positive in almost all cases. For q = 8/15 
in EXP-II, the number of samples is limited and the difference is small, so there is no sig¬ 
nificant difference. One also sees that / J 2 (o°) in Eq. (1) describes Z avg (q , T min ) relatively well. 
However, it does not mean that the experiment should be described by the simple model. As 
we shall see below, the system shows a phase transition, and the simple model is essentially 
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wrong. 

4.2 Strength of social influence and private signal 

To measure how strongly the social information and private signal affected subjects’ de¬ 
cision making, we compare the correlation coefficients between them and the subjects’ deci¬ 
sions. We estimate the correlation coefficients as 


Cor (S(t),X(t)) = 

X(t)S (0 - X{t) ■ S (0 
VV(X(f))Y(5(f)) 

Cor(Z(t- l),X(f)) = 

X{t)Z{t-\)-X{t)-Z{t) 

W(X(f))V(Z(0) 

Mt) = 

1 V" 

7 wu)’ 

i= 1 

V(A(t)) = 

A 2 (0 - A(t) . 


Here, we also define the average value A and variance V(A) of quantity A. 

Figure 5 shows plots of the correlation coefficients versus t. Overall, Cor (S(t),X(t)) 
decreases and Cor(Z(/ - 1), X(t)) increases with increasing t. In EXP-I, for q = 5/9, 
Cor(S(/j, X(t)) starts at very small values (Figure 5a). We think that subjects were confused 
at small q, and they could not trust their private signals at small l. However, Cor(S(/), X(l)) 
rapidly increases and behaves similarly to the other coefficients. At around t = 15, the correla¬ 
tion coefficients fluctuate around certain values. The results suggest that the system becomes 
stationary for t > 15. Cor(S it), X(t)) and Cor(Z(t - l),X(tj) fluctuate around 0.3 and 0.6, 
respectively. This indicates that the social influence is stronger than the private signal. 

4.3 Response functions f(z, s) 

We study how subjects’ decisions are affected by the social information and private signal. 
We study the probabilities that X{t +1) takes 1 under the condition that Z{t) = z and S {t + 1) = 
s. We denote them as 

f(z, s) = Pr(X(t + 1) = 1 \Z(t) =z,S(t+ 1) = v). 

By symmetry under the transformations S <h> 1 - 5, X 1 - X, and Z <-» 1 - Z, f(z, s) has 
the Z 2 symmetry 

1 - /(l ~ z,0) = f(z, 1 ). 

In the estimation of f(z, s) using experimental data {5 (q , i, t ), X(q, i, ?)}, we exploit the sym¬ 
metry. If S (q , i, t ) = 0, we replace ( S (q , i, t ) = 0, Z(q, i, t - 1), X(q, i, t )) with (1 - S (q, i, t ), 1 - 
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Fig. 5. Correlation coefficients Corf.S’ (f), X(t)) and Cor(Z(f-1), X(t)) vs. t in (a), (c) EXP-I and (b), (d) EXP-II. 


Z(q, i, t- 1), 1 -X(q, i, t )) and estimate f(z, 1). Then /(z, 0) is given as /(z, 0) = 1-/(1 -z, 1). 
In addition, as we are interested in the static behavior of f(z,s), and Cor (S(t),X(t)) and 
Cor(Z(t - l),X(t)) reach their stationary values at t = 15, we use data {S(q,i,t),X(q,i,t)} 
for t > 16. 

We divide the samples [X(q, i, t ), S ( q , i, t)\, 16 < t < T, i = 1, • • • , I according to the value 
of Z(q, i, t - 1). We divide them into 11 bins as Z(q, i, t ) < 5%, 5% < Z{q, i, t ) < 15%, 15% < 
Z(q, i, t) < 25%, • • • , 95% < Z(q, i, t). We write that sample (X(q, i, t ), S ( q , i, t)) is included 
in bin j e J = {1,2, • • • , 11} as i £ j and the sample number of bin j as A/g, j) = Yiiej 1- 
We denote the average value of Z(q,i,t ) in bin j as Zj = X/e, Z(q, i, t)/N(q, j). After this 
preparation, we estimate f(Zj, 1) and its error bar A f(zj, 1) as 


f(zj, 1) = 




/( Z/ ,1)(1-/(Z,-,1)) 


N(q,j) 


Figure 6 shows plots of f(zj, 1) versus z r It is clear that f{z.j, 1) are monotonically in- 
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Fig. 6. Response functions f(z, 1) for q e Q in (a) EXP-I and (b) EXP-II. f(z, 1) shows the probability that a 
subject chooses the correct urn when z percent of the previous subjects chose it and the private signal is correct. 


creasing functions of Zj in EXP-I. For q = 5/9,6/9, their behaviors are almost the same. For 
q = 7/9, 8/9, few samples appear in the middle bins, and the error bars are large. In EXP-II, 
the sample numbers are smaller than those in EXP-I. We can see a strong positive dependence 


on z,j. 
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5. Detection of phase transition 

In the previous section, we introduced a response function /(z, s ) that describes the 
probabilistic behavior of subjects in the experiments. For Zj < z < Zj+\, i e {1, • * • , 10}, 
we linearly extrapolate f(z,s ) with f(Zj,s ) and f(zj+i,s). For z < Z\ (> Zn), we adopt 
f(z,s ) = f(zi, s) (f(zu, s)). As the private signal takes 1 with probability q, the probabil¬ 
ity that the t + 1-th subject chooses the correct option under the social influence Z(l) = z is 
estimated as 

f(z) = Pr (X(t + 1) = 1| \Z(t) = z) = q • f(z, 1) + (1 - q) • f(z, 0). (4) 

We denote the averaged response function as /(z). Then the voting process \X(t)\, l = 1,2,•• • 
becomes a nonlinear Polya urn process. In this section, we study the model and verify the 
possibility of a phase transition. 

5.1 Number of stable fixed points 

We estimate /(z) using the experimental data for EXP-I. We plot the results in Figure 7. 
For q- 5/9 (thick solid line in Figure 7a), f(z) crosses the diagonal at three points. The left 
and right fixed points are stable, and the middle one is unstable. Further, z{t) converges to the 
two stable fixed points with positive probability, and the order parameter c is positive, c > 0. 
For q- 6/9 (thin solid line in Figure 7a), /(z) touches the diagonal. Considering the standard 
error of /(z), it is difficult to judge whether it is a touchpoint. However, it strongly suggests 
that there is another stable fixed point or touchpoint in addition to the right stable fixed point. 
For q = 7/9 (thin broken line in Figure 7a), /(z) seems to have only one stable fixed point. 
However, the departure from the diagonal is small, and it is difficult to judge whether there 
is only one stable fixed point or there are two stable fixed points. For q = 8/9 (thick broken 
line), there is only one stable fixed point, and c is zero. 

5.2 Correlation function C{t) 

The order parameter c of the phase transition is defined as the limit value of C(t). C(t) 
behaves asymptotically with three parameters, c, c' and / > 0, as 

C(t) ^c + c -t l ~\ (5) 

If there is one stable state, z+, Z(t ) converges to z+. The memory of X(l) = x in p x (t + 1) 
disappears, and c = 0. C{t) decreases to zero with power-law behavior, C(t) oc t l ~ l . The 
exponent / is given by the slope of fix) at the stable fixed point z+ as / = q\z+). If there are 
multiple stable states, Z- < z+, the probability that z(t) converges to z+ depends on X{ \ ) = x. 
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Fig. 7. Plot of f(z) = Pr (X(t + 1) = 1 \z.(t) = z ) in EXP-I. (a) t > 15, q = 5/9 (thick solid line), 6/9 (thin solid 
line), 7/9 (thin broken line), and 8/9 (thick broken line), (b) q = 5/9, f > 15 (thick solid line), q = 8/9 (thick 
broken line), q e {5/9,8/9), t > 30 with symbols. We plot the standard error A f(z) for q = 8/15 in (a) and 
t > 30 in (b). 


If c = lim^oo (pi(t + 1) - po(t + 1)) is subtracted from C(t), the remaining terms also obey a 
power law as C{t) - c oc t l ~ ] . The exponent l is given by the larger of {q'(z+),q'{.Z-)}, as the 
term with the larger value governs the asymptotic behavior of C{t) - c. 15 If we adopt f(z) in 
Figure 7a, there are two stable states for q- 5/9 and q - 6/9. For q = 7/9 and q = 8/9, there 
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is only one stable state. This suggests that a phase transition occurs depending on q. 

We study the correlation function C(t). First, p x (t + 1) = Pr(X(Y + 1) = 1|X(1) = x) and 
their error bars A p x (t + 1) are estimated from the experimental data {X{q, i, t)\ as 


Px(t + 1) 

N x (q) 


1 

N(q, x ) 


J]X(q,i,t + 1 )8x(q,i,i), X i 

iel 



iel 


A p x (t + 1) 

C(t) is then estimated as 


p(x,t+ 1)(1 -p x (t+ 1)) 


N x (q) 


C(t) = pi(t+ l)-p 0 (t+ 1). 


The standard error of C{t) is given by 

A C(t) = yjAp\(t + l ) 2 + Ap 0 (t + l) 2 . ( 6 ) 




Fig. 8. C(t) vs. t in (a) EXP-I and (b) EXP-II. Error bars are estimated using Eq. (6). To see the behavior of 
C(t) clearly, we plot only C(t) for At = 5(3) for EXP-I (II). In addition, we shift the data for q = 5/9,6/9(8/15) 
leftward and those for q = 7/9,8/9(6/9) rightward for EXP-I (II). 


Figure 8 shows plots of C{t) for t < T min as a function of t in EXP-I and EXP-II. In both 
experiments, the error bars are large. In EXP-I, C(t ) fluctuates around 0.25 for q 6 {5/9,6/9}. 
For q 6 {7/9, 8/9}, C{t) decreases and takes small values for large t. However, it is difficult to 
judge whether C(t) decreases to zero or fluctuates around some positive values. In EXP-II, in 
all three cases, C(t) seems to fluctuate around 0.2. 
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5.3 Estimation of C it) for t > T min 

As the system size T min in our experiments is very limited, we adopt the Polya urn pro¬ 
cess based on Eq. (4) to simulate the system for t > T min . We introduce a stochastic process 
{X(Y)}, t £ {1,2,3, • • • ,T). X{t + 1) 6 {0,1} is a Bernoulli random variable, and its probabilis¬ 
tic rule depends on all the previous [X{t')},t' £ {1, - - - ,t} through C\{t) = Ylf=\ X{t'). The 
probability that X(t + 1) is 1 for C\(t) = n { is given by f(njt). We denote the probability 
function for Yl t '=\ X(t') = n with an initial condition X(l) = x as P{t, n\x). 


( t 


Y J X{t') = n\X{\) = 


\ f =i 


P(t, n\x) = Pr 

The master equation for P(t, n\x) is 

P(t + \,n\x) = f{n - 1/0 • Pit, n- l|x) + (1 - f(n/t)) ■ P(t, n\x). 


(7) 


We use the experimental data from EXP-I as the initial condition for t = T min (Figure 3). We 
solve the master equation recursively and obtain P(t, n\x) for t < 10 6 . We estimate C{t) as 

t t -1 

C(0 = J] P(t, n\l ) • fin/t) - ^ Pf, n\0) ■ fin/t). 

n= 1 n=i) 

Figure 9 shows the plots of C( t) versus t. For q = 5/9, C(t) converges to a finite and 



Fig. 9. C(t) vs. t for 10 1 < t < 10 6 . For t < T min , we plot the results in Figure 8a with At = 10. 


positive value, and c > 0. For q = 8/9, C(0 decays to zero very slowly. For q = 7/9, C( t) 
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decays more slowly, and it takes a finite value even for t ~ 10 6 . From the slope of C(t) there, 
we can assume the limit value of C(t) is zero. For q - 6/9, the situation is more subtle. If f(z) 
has a touch point, C{t ) decays logarithmically as 

C(t) ~c + c'(lnO _1 . 

In this case, it is difficult to judge whether the limit value of C(t ) is positive or zero, as C(t) 
decreases too slowly. Even if it is uncertain, we can say that c is positive for q = 5 /9 and zero 
for <7 = 7,9 and 8/9. The system shows a phase transition. 

5.4 Estimation of c 

To estimate c, we employ the integrated quantities of C(t), which are the integrated cor¬ 
relation time r and the second moment correlation time </ divided by the time horizon t. They 
are defined in terms of the moments of C(s) as 


T,(t) 

m 

= r( 0 /f = m 0 (t)/t, 

= £(t)/t= V m 2 (t)/mo(t ), 

(8) 

m n (t) 

t- 1 

^ J]c( 5 )(5/0 n . 

,s=0 

(9) 




Fig. 10. Plots of (a) r,(t) and (b) vs. t. 


By using the asymptotic behavior of C(t ) in Eq. (5), the limit values of r t (t) and g t (t) are 
found to be 


c 

limr,(t) = lim c-l— t'~ l = c, 

t—> CO t—> oo / 


(10) 
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[ aS » C = 0, 

= V +2 (11) 

IV) ■ 

The limit value of r,(0 coincides with c. With the limit value of we can judge whether 
c > 0 or c = 0 by lim^oo f,(t) = yflj3 or lim^oo f,(t) < yflfi. 

Figure 10 shows plots of T t (t) and f t {t) versus I. T,(t) increases gradually with 1 for l > T min 
and q = 5/9. For sufficiently large t, T t (t ) for q = 5/9 is larger than that for q = 6/9. For 
£7 = 7/9 and 8/9, r f (t) decreases to zero monotonically, suggesting that c = 0. fit) for large 
t is smaller than 3 -1/2 for q e {7/9, 8/9}, also suggesting that c = 0. For q e {5/9,6/9}, £ ft ) 
converges to 3 _1/2 as t increases, suggesting that c > 0. From these results, we conclude that 
c decreases with increasing q for q e {5/9,6/9} and c = 0 for q e {7/9, 8/9}. 


5.5 Plot of P{T, n\x = 0) 

Lastly, we show the time evolution of P(t, n\x) for the sample with X(q, i, 1) = x = 0. The 
boxplot of Z(q, i, T min ) for v = 0 in Figure 3 shows the initial configuration for P(T min , n\0). 
As there is only one stable fixed point, z+, for q e {7/9, 8/9}, Z(q,i,t ) should converge to 
z+. The main interest lies in whether the samples with Z{q, i, T min ) < 1/2 for q 6 {7/9, 8/9} 
converge to z+. On the other hand, for q e {5/9,6/9}, there are two stable fixed states, and 
P{t, n\0) should have two peaks. 

Figure 11 shows plots of P(t,n\0) for q e {5/9,6/9,7/9, 8/9} and t e {T min , 10 4 ,10 6 }. 
Pit, n\0) for q 6 {5/9,6/9} clearly has two peaks for t = 10 6 . However, there is also a clear 
difference in the convergence of P(t,n\0). For q = 5/9. the peak at the lower stable fixed 
point z_ is sharp for t - 10 6 , suggesting that the convergence is rapid. On the other hand, 
for q = 6/9, the height of the peak at the touchpoint is low, suggesting slow convergence. 
If f(z) has a touchpoint at q t , Z{q, i, t ) converges to q, as \q t - Z(q, i, t)| oc (In t)~ { if Z(q, i, t) 
starts below q t . This slow convergence is reflected in the shape of the peak at q t . For q - 8/9, 
only one peak appears, and the sample with Z(q, i, T mtn ) <1/2 converges to z+ at / = 10 6 . 
For q = 7/9, as the deviation of f(z) from the diagonal is small, the convergence to the 
unique stable fixed point q + is remarkably slow. Even at / = 10 6 , a positive probability of 
Z(q, i, t) < 1/2 remains. In the limit t —* oo, the probability should disappear, and it is difficult 
to detect it experimentally. 

6. Summary and Comments 

We propose a new method of detecting a phase transition in a nonlinear Polya um in an 
information cascade experiment. It is based on the asymptotic behavior of the correlation 
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Fig. 11. Plots of P(t, n\x) with * = X(q , i, 1) = 0 for q e {5/9,6/9,7/9,8/9) and t e {T min , 10 4 ,10 6 }. 


function C{t) - c + c' ■ f M . The limit value c of C(t) is the order parameter of the phase 
transition. The phase transition is between the phase with c = 0, in which there is only one 
stable state, and the phase with c > 0, in which there is more than one stable state. To estimate 
c and detect the phase transition, we propose to use the correlation times r(f) and £(/) divided 
by t. We perform an information cascade experiment to verify the method. The experimental 
setup is the canonical one in which subjects guess whether the randomly chosen urn X is um 
A or urn B. We control the precision of the private signal q by changing the configuration of 
colored balls in the ums. We successfully detected the phase transition in the system when q 
changed. For large q, c = 0, and there is only one stable state. The system is self-correcting. 
For small q, c > 0, and there are multiple stable states. The probability that the majority’s 
choice is incorrect is positive. 

We comment on the system size in the experiment. In this paper, we reported on two 
experiments, EXP-I and EXP-II, which differ mainly in the system size T and sample number 
I. Regarding the system size T, as Cor(5 (/), AT/)) and Cor(Z(/-l), X(/)) fluctuate around some 
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value for t > 15, the minimum size of T should be larger than that value in order to study the 
stationary behavior of the system. Furthermore, to estimate c from the asymptotic behavior 
of C(f), it is necessary to estimate f(z) precisely. For this purpose, Z(q, i, t ) should take all 
the values in [0,1]. As t increases, Z(q, i, t ) converges to some stable fixed point of /(z). We 
cannot gather enough data to cover all the values z e [0,1] if t becomes too large. Instead of 
setting T to be large, we should set I to be large. In EXP-I, we judge that there is only one 
stable fixed point for q = 8/9. The difficulty of determining phases comes from the error bars 
in the estimate of /(z). As the error bars A/(z) are proportional to 1 / V7, 1 should be as large 
as possible, to reduce A/(z). Considering the standard errors A/(z) in Figure 7, in order to 
judge whether there is only one stable fixed point for /(z) for q = 8/9 in EXP-I, I should be 
four times that in EXP-I. Although / = 4 x 400 = 1.6 x 10 3 might be large for a laboratory 
experiment, it is realizable in a web-based online experiment. 4 - 5 

Another future problem is to understand and derive the response function theoretically. A 
theoretical investigation using experimental data for an information cascade in a two-choice 
general knowledge quiz was recently performed. 24 The problem in analyzing the data for 
an information cascade in a general knowledge quiz is the difficulty in controlling the pri¬ 
vate signal. 21 The information cascade experiment with a two-choice urn is ideal from this 
viewpoint. The experimenter can control the private signal freely and study the change in the 
subjects’ choices. To understand the response function, it is necessary to control the number 
of referenced subjects. We believe that experiments along these lines should be performed. 
The multi-choice quiz case might be an interesting experimental subject. In that case, the 
corresponding nonlinear Polya model is similar to the Potts model. 25 The problem is whether 
the herding strength increases or decreases as the number of options changes. We believe that 
the accumulation of experimental studies in these directions is important for the development 
of econophysics 26 " 28 and sociophysics. 29 - 30 
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Appendix A: 

We denote the probability function for N(t) e {-2, -1,0,1,2} with the initial condition 
X(l) = x as 

Pnit\x) = Pr m) = n\X{ 1) = X). 

Po(2n) is easily estimated as 

P 0 (2n\x) = P 0 (2\x)-(2q(l-q)y. 

P ±2 (t) satisfies the following recursive relations for even t, 

P 2 (2n + 2\x ) = P 2 {2n\x) + q 2 ■ P 0 (2n\x), 

P- 2 (2n + 2\x) = P- 2 (2n\x) + (l - q) 2 ■ P 0 (2n\x). (A-l) 

P ± \(t) = 0 for even t. For odd t, P n (t) are estimated as 

P 2 (2n + 1 \x) = P 2 {2n\x) , P. 2 (2n + \ \x) = P- 2 {2n\x), 

P\(2n + \\x) = q ■ Po(2n\x) , P-i(2n + l|v) = (1 - q) ■ P^{2n\x). (A-2) 

P 0 (t) = 0 for odd t. The initial condition for the recursive relation is 

Po(2\x) = q ■ 4,o + (1 - q) • 5 x +,P 2 (2\x) = q ■ 8 x ^,P- 2 {2\x) = (1 - q)8 xfi . 

By solving the recursive relations with the initial condition, we have 

n ^ nnn , 2 nnn 1 “(2^(1 ~ q)T~ l 

P 2 (2n\x) = P 2 (2\x) + q P 0 (2\x) ---—-, 

1 -2q(l -q) 

, 1 - (2(7(1 - q)) n - 1 

P- 2 (2n\x) = P^{2\x) + {\-q) 2 P Q {2\x)- - -. (A-3) 

1 -2^(1 -q) 

The unconditional probability for an up cascade is 

97 1 - (2<?(1 - q)) n 

P 2 (2n) = q ■ P 2 (2n\\) + (1 - q) ■ P 2 (2n\0) = q 2 + q~ ■ 2q(l - q)— -- ——- -. 

1-241 -q) 
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In the limit n —» oo, it converges to 


P 2 (°°) - lim P 2 (2n) = 


q 2 + (i -q) 2 ' 


n- 


oo 


Pr(X(2 n + 1) = 1|X(1) = x) is then estimated as 


Pr(X(2/7 + 1) = 1|X(1) = x) = P 2 (2n\x ) + q ■ P 0 (2n\x). 


C(2n ) is then given as 



(y/(2q(l - q))) 2n - 


For t = 2n + 1, we can show that C(2n + 1) = C(2n). 

Appendix B: Additional information about EXP-I 

We explain EXP-I in detail. We performed the experiment in 2013, 2014, and 2015. We 
recruited 126, 109, and 121 subjects in 2013, 2014, and 2015, respectively. 

In 2013, the duration of the experiment was 13 days; we recruited 126 subjects and per¬ 
formed the experiment for q e {5/9,6/9}. Subjects had to participate in the experiment twice. 
In the first session, subjects answered 100 questions for q = 6/9. After a 5 min interval, they 
participated in another cascade experiment. In one session, a subject had to participate in two 
types of information cascade experiment. In the second session, subjects answered 100 ques¬ 
tions for q = 5/9. After a 5 min interval, they participated in another cascade experiment. 
The allotted time for one session was 90 min, which included time for an explanation of the 
experiment. The subjects received 10 yen (about 8 cents) for each correct choice. After they 
participated in two sessions for two values of q, they were given their reward. 

We performed the experiment for q = 7/9 in 2014. The duration of the experiment was 
13 days, and we recruited 109 subjects. Thirty-nine of the subjects had participated in the 
experiment in 2013. As in the experiment in 2013, after they answered 100 questions for 
q = 7/9, they participated in another cascade experiment. A problem with the web server 
used for the experiment occurred on the first day in 2014, and some participants could not 
answer all 100 questions in the allotted time. The subjects received 5 yen (about 4 cents) for 
each correct choice. 

In 2015, we performed the experiment for q = 8/9. The duration of the experiment was 7 
days, and we recruited 121 subjects. In the experiment, the subjects answered 200 questions 
for <7 = 8/9 only, and they did not participate in another experiment. Ten of the subjects had 
participated in both of the first two experiments. Within the allowed time of about 40 min, 
they could not answer all questions. The subjects received 5 yen (about 4 cents) for each 







J. Phys. Soc. Jpn. 


correct choice in addition to a payment of 150 yen (about 1.2 dollars) for participating. 

Next, we explained the experimental procedure. Subjects entered a room and sat in a seat. 
There were two documents on the desk in front of the seat: an experimental participation 
consent document and a brief explanation of the experiment. The experimenter described the 
experiment and the reward using the document. Next, the subjects signed the consent docu¬ 
ment and logged into the experiment’s web site using IDs assigned by the experimenter. Then 
they started to answer the questions. After the experiment started, communication among 
participants was forbidden. A question was chosen by the server used for the experiment and 
displayed on the monitor of a 7 in. tablet (e.g., Nexus 7). There were no partitions in the 
room, and subjects could see each other. However, the displays on the tablets were small, and 
the subjects could not see which question the other subjects received and which option they 
chose. 

Appendix C: Additional information about EXP-II 

We recruited 33 subjects for EXP-II. We performed the experiment in one day. Originally, 
we planned to obtain data for the experiment with T - 33 and Q = {6/9,5/9, 8/15} twice 
within 3 h. We prepared I = 33 questions and the private signals U(q,i,t ) for T subjects 
for question q , i. We let all 33 subjects enter an information science laboratory, and they 
participated in the experiment simultaneously. Subject j - 1, • • • , 33 answered question i = 
1, • • • , I as the t = (/' + j - 2)mod33 + 1-th subject. However, this procedure caused a “traffic 
jam,” and the server used for the experiment could not serve questions smoothly. Within the 
3 h allotted, we could gather data only for the first three cases, i.e., 99 questions. Subjects 
received 10 yen (about 8 cents) for each correct choice. There was a payment of 3000 yen 
(about $25) for participating. 

Appendix D: Asymptotic behavior of \{z(q, t)) 

We studied the asymptotic behavior of the variance of Z(q, t) and verified the possibility of 
the phase transition. In contrast to the method based on C{t), the analysis of the variance has 
the advantage that it can directly detect the existence of multiple stable states. The drawback 
is the estimation of the standard errors, as we do not know the the distribution of Z(q, t). 

Figure 11 shows plots of \(Z(q,t)) versus t. For q e {5/9,6/9} in EXP-I and for all 
cases in EXP-II, V(Z(q, t)) seems to converge to some positive value for large t. The result 
is consistent with the result that there are multiple stable states in the system in these cases. 
\(Z(q,t)) exhibits power-law behavior as \{Z{q,t)) oc t /_1 with / = 0.758(5) and 0.662(4) 
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Fig. D-l. W(Z(q, t)) vs. t in (a) EXP-I and (b) EXP-II. Solid line in (a) shows the results fitted with V(Z, q, t) oc 
f'- 1 and / = 0.758. 


for q = 7/9 and q = 8/9, respectively. There is only one stable state in the system. The 
asymptotic behavior of V(Z(tj) and that of C(t ) is the same if l > 1 /2. 15 

Appendix E: Archive of experimental data 

In the arXiv site for this manuscript, we uploaded the experimental data for both ex¬ 
periments. The data are provided as CSV files, EXP-I.csv and EXP-II.csv. They contain 
X(q, i, t), S (q, i, t), ID(q, i, t), and C(q,i,t ) for q e Q, i e {1, - - ■ , |/|}, and t 6 {1, - - - ,T). 
Here C(q, i, t ) 6 {50%, 60%, • • • , 100%} indicates the confidence of the subject regarding the 
choice X{q, i, t). In EXP-II, the subject chose A or B directly instead of in terms of the confi¬ 
dence level, so there are no data for the confidence. ID(q, i, t) are the identification numbers 
of the subjects. In EXP-II, ID £ {1, • • • , 33}, as there were 33 subjects. In EXP-I, in 2013, 
there were 126 subjects, and we labeled them as ID 6 {1, • • • , 126}. In 2014, there were 109 
subjects, 39 of whom had participated in the first period. We used the same IDs for these 39 
subjects and labeled the remaining 70 subjects as ID 6 {127, • • • , 196}. In 2015, there were 
121 subjects, 10 of whom participated in both experiments in 2013 and 2014. We labeled the 
remaining 111 subjects as ID 6 {197, • • • , 307}. 

The first column in the data file is n in q = n/(n + m), the second column is i, the third 
column is t, the fourth column is X{q, i, t ), the fifth column is S ( q , i, t ), the sixth column is 
ID(q, i, t), and the last column is C(q, i, t). 
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